Unsupervised Learning Using MML
نویسندگان
چکیده
This paper discusses the unsupervised learning problem. An important part of the unsu-pervised learning problem is determining the number of constituent groups (components or classes) which best describes some data. We apply the Minimum Message Length (MML) criterion to the unsupervised learning problem , modifying an earlier such MML application. We give an empirical comparison of criteria prominent in the literature for estimating the number of components in a data set. We conclude that the Minimum Message Length criterion performs better than the alternatives on the data considered here for unsupervised learning tasks.
منابع مشابه
Unsupervised Learning of Gamma Mixture Models Using Minimum Message Length
Mixture modelling or unsupervised classification is a problem of identifying and modelling components in a body of data. Earlier work in mixture modelling using Minimum Message Length (MML) includes the multinomial and Gaussian distributions (Wallace and Boulton, 1968), the von Mises circular and Poisson distributions (Wallace and Dowe, 1994, 2000) and the distribution (Agusta and Dowe, 2002a, ...
متن کاملMML-Based Approach for Finite Dirichlet Mixture Estimation and Selection
This paper proposes an unsupervised algorithm for learning a finite Dirichlet mixture model. An important part of the unsupervised learning problem is determining the number of clusters which best describe the data. We consider here the application of the Minimum Message length (MML) principle to determine the number of clusters. The Model is compared with results obtained by other selection cr...
متن کاملEnhancing MML Clustering Using Context Data with Climate Applications
In Minimum Message Length (MML) clustering (unsupervised classification, mixture modelling) the aim is to infer a set of classes that best explains the observed data items. There are cases where parts of the observed data do not need to be explained by the inferred classes but can be used to improve the inference and resulting predictions. Our main contribution is to provide a simple and flexib...
متن کاملThe MML Evolution of Classi cation
Minimum encoding induction (MML and MDL) is well developed theoretically and is currently being employed in two central areas of investigation in machine learning|namely, classiication learning and the learning of causal networks. MML and MDL ooer important tools for the evaluation of models, but ooer little direct help in the problem of how to conduct the search through the model space. Here w...
متن کاملIntrinsic Classification of Spatially Correlated Data
Intrinsic classification, or unsupervised learning of a classification, was the earliest application of what is now termed minimum message length (MML) or minimum description length (MDL) inference. The MML algorithm ‘Snob’ and its relatives have been used successfully in many domains. These algorithms treat the ‘things’ to be classified as independent random selections from an unknown populati...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1996